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(57) Abstract: In accordance with a first aspect of 
the present invention, a digital device is provided wiih 
a data transmitter designed to receive constituting 
elements of a data structure, determine occurence 
frequency of each unique constituting element in the 
data structure, assign a cookie representation to each 
of the unique constituting elements based al least 
in part on the occurence frequencies of tlie unique 
constituting elements, and transmit the data structure 
implicity in a substantively equivalent form that 
allows a receiver of the data structure in the substan- 
tively equivalent form to be able to reconstitute the 
data structure using the occurence frequency based 
cookie representations. In accordance with another 
aspect of the present invention, a digital device is 
provided with a data receiver designed to receive 
unique constituting elements of u dutu structure 
transmitted in a pre -determined manner, infer 
corresponding cookie representations for the received 
unique constituting elements in accordance with their 
manner of transmissions under the pre-determined 
manner of transmission, and receive the constituting 
elements of the data structure in a representative 
form. In one embodiment, the data receiver is further 
designed to reconstitute the constituting clemenLs of 
the data structure, received in the representative form, 
based on the interred cookie representations. 
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Method and Apparatus for Sending and Receiving A Data Structure in a 
Constituting Element Occurrence Freguencv Based Compressed Form 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to the fields of data processing. More 
specifically, the present invention relates to the sending and receiving of data 
stnjctures in a bandwidth reduction form. 

2. Background Information 

Recently, with advances in the Internet and web based applications, 
semi-structured data structures, such as Extensible Markup Language (XML) 
data structures, have become an industry standard mechanism to either 
transfer or store data. Semi-structured data stmctures are favored over other 
conventional fixed and/or application specific data structures because of the 
extensibility, transparency, platform-independency and manageability. These 
data structures allow two pieces of software programs that are independently 
developed to communicate with each other. However, transmission of these 
semi-structured data structures has at least two drawbacks, a) the size of the 
data structure having to be transferred and (b) the associated processing cost 
(especially on the receiver side). 

Size: Semi-structured data stmctures, such as XML data structures, are 
typically very redundant when compared to other conventional fixed, 
application specific data structures. Many tag names and attribute names 
must be repeated over and over again. For example, it usually takes 100- 
300% more bytes to represent the same data in XML. In addition, it is very 
common that there are many duplicate attribute values. Consider the example 
"Employees" XML data structure illustrated in Fig. 4a, the tag name 
"Employee" and attribute names "Employee ID" and "Title" are repeated over 
and over again. 
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Processing Cost: Semi-structured data structures, such as XML, are 
also very expensive to parse. Typically, the data sender either builds the data 
structure directly concatenating a number of strings or feeding them into a 
stream, or builds an object hierarchy and then serializes it into a string or 
stream. On the receiver side, the receiver code must then scan the data 
string/stream to sequentially look for space characters to tokenize, and 
compare each tag names and attributes with known keywords. Further, such 
parsing requires a lot of memory, especially if each token is stored as a 
separate string object. 

These drawbacks are especially problematic for smaller devices with 
limited CPU-power and small amount of memory (such as wireless mobile 
phones and palm sized personal digital assistants) with lower data 
transmission speed. In certain applications, such as Nippon Telephone 
Telegraph - DoCoMo's iMode, the operation cost can be significantly higher, as 
the application operator charges for the service on a per-packet basis. 

Thus, a more efficient approach to transmitting such data structures is 
desired. 

SUMMARY OF THE INVENTION 

In accordance with a first aspect of the present invention, a data 
transmitter is designed to receive constituting elements of a data structure, 
determine occurrence frequency of each unique constituting element in the 
data stmcture, assign a cookie representation to each of the unique 
constituting elements based at least in part on the occurrence frequencies of 
the unique constituting elements, and transmit the data structure implicitly in a 
substantively equivalent form that allows a receiver of the data structure in the 
substantively equivalent form to be able to reconstitute the data structure using 
the occurrence frequency based cookie representations. 

In accordance with another aspect of the present invention, a data 
receiver is designed to receive unique constituting elements of a data stmcture 
transmitted in a pre-determined manner, infer conresponding cookie 
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representations for the received unique constituting elements in accordance 
with their manner of transmissions under the pre-determined manner of 
transmission, and receive the constituting elements of the data stmcture in a 
representative form. In one embodiment, the data reeiver is further designed 
to reconstitute the constituting elements of the data structure, received in the 
representative form, based on the Inferred cookie representations. 

In one embodiment, the data stmcture is a XML data structure. The 
constituting elements include tag names, attribute names, and attritbute 
values. 

In one embodiment, a digital device is provided with the data transmitter. 
In another embodiment, a digital device is provided with the data receiver. In 
yet another embodiment, a digital device is provided with both. 

In one embodiment, the digital device is a wireless mobile phone. In 
another, the digital device is a palm sized personal digital assistant, a notebook 
sized computer, a desktop computer, a set top box, or a server. 

BRIEF DESCRIPTION OF DRAWINGS 

The present invention will be described by way of exemplary 
embodiments, but not limitations, illustrated in the accompanying drawings in 
which like references denote similar elements, and in which: 

Figure 1 illustrates an overview of the present invention, in accordance 
with one embodiment; 

Figures 2a-2b illustrate a method view of the present invention, in 
accordance with one embodiment; 

Figures 3a-3c illustrate example data structures suitable for use to 
practice the present invention, in accordance with one embodiment; 

Figures 4a-4g illustrate an example application of the present invention 
to the transmission of an example XML data structure; and 

Figure 5 illustrates an architectural view of an example computing 
device, suitable for practicing the present invention, in accordance with one 
embodiment. 

3 
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DETAILED DESCRIPTION OF THE INVENTION 

In the following description, various aspects of the present invention will 
be described. However, it will be apparent to those skilled in the art that the 
present invention may be practiced with only some or all aspects of the present 
invention. For purposes of explanation, specific numbers, materials and 
configurations are set forth in order to provide a thorough understanding of the 
present invention. However, it will also be apparent to one skilled in the art that 
the present invention may be practiced without the specific details. In other 
instances, well known features are omitted or simplified in order not to obscure 
the present invention. 

Parts of the description will be presented using terms such as data 
structures, tag names, attribute names, and so forth, commonly employed by 
those skilled in the art to convey the substance of their work to others skilled in 
the art. Parts of the description will be presented in tenms of operations 
performed by a computing device, using terms such as receiving, determining, 
transmitting, and so forth. As well understood by those skilled in the art, these 
quantities and operations take the form of electrical, magnetic, or optical signals 
capable of being stored, transferred, combined, and othenA^lse manipulated 
through mechanical and electrical components of a digital system. The term 
digital system includes general purpose as well as special purpose computing 
machines, systems, and the like, that are standalone, adjunct or embedded. 

Various operations will be described in tum in a manner that is most 
helpful in understanding the present invention, however, the order of description 
should not be construed as to imply that these operations are necessarily order 
dependent. Furthermore, the phrase "in one embodiment" will be used 
repeatedly, however the phrase does not necessarily refer to the same 
embodiment, although it may. 



4 



wo 02/39592 



PCT/USOl/22488 



Overview 

Referring now to Figure 1, wherein a block diagram illustrating an 
overview of the present invention, in accordance with one embodiment is 
shown. As illustrated, in accordance with one aspect of the present invention, 
data sender system 102 is advantageously provided with data transmitter 108 
of the present invention, to assist a data sending application, such as data 
sender 104, to transmit semi-structured data structures, such as XML data 
stnjctures, as represented by data structures 106, in a more efficient, compact, 
and bandwidth reduced manner. As will be described in more detail below, 
data transmitter 108 effectuates transmission of data stmctures 106 in the 
desired manner, by transmitting occurrence frequency based cookie 
representations of the "tokens", i.e. data elements, of data stmctures 106 
instead. For the illustrated embodiment, the novel transmission of the 
occun-ence frequency based cookie reprensentations are perfomied, 
employinig dictionary 110 and array 112. As will be described in more detail 
below, dictionary 110 is employed to store the occurrence frequency based 
cookie representations for encoding the "tokens", whereas array 112 is used to 
store the encoded 'lokens", i.e. their cookie representations. 

In accordance with another aspect of the present invention, data 
receiver system 114 is advantageously provided with complementarilty 
equipped data receiver 116 to assist the ultimate data recipient 118 in receiving 
data structure 106 transmitted in the above described efficient manner. For the 
illustrated embodiment, data receiver 116 effectuates the assistance employing 
dictionary 110', which as will be described in more detail beliow, is provided by 
data transmitter 108. 

Except for the respective provisions of data transmitter 108 and data 
receiver 116 to sender system 102 and receiver system 114, sender system 
102 and receiver system 114 are othenvise intended to represent a broad 
range of digital devices known in the art, including but are not limited to, 
wireless mobile phones, palm sized personal digital assistants, notebook sized 
computers, desktop computers, set-top boxes, servers, and the like. Of 

course, sender system 102 and receiver system 114 may also be further 
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provided with data receiver 116 and data transmitter 108 respectively, allowing 
these systems to function in the role of a data sender at one point in time, and 
in the role of a data receiver at another point in time. For these embodiments, 
of course data transmitter 108 and data receiver 116 may be provided as a 
combined unit or component, i.e. a data tranceiver, having both the 
transmission as well as the reception capabilities of the present Invention. On 
the other hand, in alternate embodiments, data sender 104 and data 
transmitter 108 may be disposed in different systems. Similarly, data receiver 
116 and ultimate data recipient 118 may also be disposed in different systems. 

Further, sender system 102 and receiver system 114 may be coupled to 
each other via any one of a number of wireless or wireline based 
communication interfaces, using any one of a number of communication 
protocols. For example, the communication interface may be a wireless 
medium, using the TCP/IP communication protocol, signaled in accordance 
with the GSM, CDPD, CDMA or WCDMA signalling protocol. Alternatively, the 
communication may be a wireline based medium, again using the TCP/IP 
communication protocol, signaled in accordance with the Ethemet signalling 
protocol. In general, as those skilled in the art will appreciate, the present 
invention may be practiced in any communication/signal protocols on any 
communication medium. 

Similarly, while for ease of understanding, the present invention will be 
described referencing XML data structures and examples expressed in XML, 
those skilled in the art would appreciate that the present invention may also be 
practiced on other data strcutures, including but are not limited to HTML or 
WML encoded contents. 

Method 

Referring now to Figures 2a-2b, wherein two block diagrams illustrating 
the novel data sending and receiving method of the present invention in further 
detail, in accordance with one embodiment, are shown. As illustrated in Fig. 
2a, at block 202, data sender 104 "transparently" sends constituting elements 
of data structure 106 (such as tag names, attribute names, and attribute 
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values, in the case of an XML stmcture) in plain text, as in the prior art. That 

is, legacy data sender 104 may continue to send data as in the prior art without 

having to make any adjustments to its operation, nor having to be congnizant 

of the practice of the present invention. However, in alternate embodiments, 

data sender 104 who's cognizant of the present invention, may further take 

advantage by sending the data elements of data strcuture 106 in token form. 

In accordance with the present invention, the data elements are received by 

data transmitter 108 and turn into token form if received in the plain text form. 

Data transmitter 108 would parse the received data strcuture 106 to "tokenize" 

its data elements, using any one of a number of parsing techniques known in 

the art. Using example "Employees" XML data structure 400 illustrated in Fig. 

4a as an example, as the constituting elements of example structure 400, i.e. 

"<", "Employees", ">", and so forth, are sent "transparently" by data sender 104, 

data transmitter 108 receives the constituting elements as 'lokens", as 

illustrated in Fig. 4b. 

^ Refening back to Fig. 2a-2b, at block 204, data transmitter 108 encodes 

the "tokens" with cookie representations. More importantly, the cookie 

representations are functionally dependent on the occurrence frequencies of 

the unique "tokens" in data stmcture 106. Using the example "Employees" 

XML data structure 400 illustrated in Fig. 4a as an example again, the 

constituting elements are encoded as illustrated in Fig, 4f, using the 

occurrence frequency based cookie representations of Fig. 4e. For example, 

the token ">" is encoded with the numeric cookie representation of "1", as the 

token ">" is the most frequently occurred token, among the tokens of example 

data structure 400 (8 times), the token "=" is encoded with the numeric cookie 

representation of "2", as the token "=" is the next most frequently occurred 

token, among the tokens of example data structure 400 (6 times), and so forth. 

[Ties are broken arbitrarily.] In one embodiment, the encoding is a multi-step 

process, to be described in more detail below. 

Thus, under this embodiment of the novel occunrence frequency based 

encoding scheme of the present invention, the most frequently occurred token 

is encoded with a numeric cookie representation having the lowest numeric 
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value (relative to other numeric cookie representations employed for the data 
structure being transmitted), the next most frequently occurred token is 
encoded with a numeric cookie representation having the next lowest numeric 
value, and so forth. 

As those skilled in the art would appreciate, under this scheme, the first 
127 most frequently occurred unique tokens may be transmitted employing one 
byte of bandwidth for each token, that is with each token as a datum with a size 
of one byte, whereas the next 32,640 most frequently occurred unique tokens 
may be transmitted employing two bytes of bandwidth for each token, that is 
with each token as a datum with a size of two bytes. The two formats may be 
differentiated e.g. using the most significant bit. As a result, a data structure 
may be advantageously transmitted with further reduction in bandwidth 
required, as the more frequently occurred tokens are transmitted with one byte 
encodings, while only the less frequently occurred tokens are transmitted with 
two byte encodings. 

Referring back again to Fig. 2a-2b, at block 206, data transmitter 108 
transmits the unique "tokens" and "conveys" their cookie representations to 
data receiver 116. In one embodiment, the cookie representations of the 
'tokens" are implicitly conveyed. That is, the cookie representation are not 
explicitly transmitted. Instead, the unique "tokens" are transmitted in a pre- 
determined manner, and data receiver 116 infers the cookie representations 
from the manner the unique "tokens" are transmitted under the predetermined 
manner. Again referring to the example encoding illustrated in Fig- 4e, the 
tokens ">", "Employees", and so forth, are transmitted in order of their 
occurrence frequencies, accordingly their cookie representations, i.e. "1", "2", 
and so forth, may be inferred from the transmission positions of the tokens. 

Thereafer, at block 208, data transmitter 108 transmits the "tokens" in 
their encoded representative form. In one embodiment, data transmitter 108 
transmits the tokens (implicitly conveying their encodings), and the encoded 
representations as one contiguous string or stream (to be described more fully 
below). At block 210, upon receipt of^the list of unique tokens (and their 
encodings), and the encoded representations, data receiver 116 reconstitutes 

8 
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the original data structure, i.e. regenerating the original data elements based 
on the received encoding representations and the unique tokens (and their 
corresponding encoding representations), for ultimate data recipient 118. As a 
result, the amount of processing required on the receiver side to accept the 
transmitted data structure is also significantly reduced. Further, by remapping 
the tokens back to the original data elements, the method may be made 
transparent to legacy data receivers. However, in altemate embodiments, data 
recipients 118 cognizant of data receivers 116 may further take advantage of 
the present invention, and reduces its storage employed to store received data 
strcutures by having data reciever 116 provides the received data structure in 
the token form, without reconstituting the original data elements. 

Figure 2b illustrates the encoding operation of block 204 in further 
details, in accordance with one embodiment. As illustrated, at blocks 222 and 
224. data transmitter 108 first encodes the tokens with an initial encoding as 
the tokens are received/identified, and stores the received/identified tokens in 
their representative form. Additionally, data transmitter 108 tracks each of the 
unique tokens encountered, its initial encoding, and more importantly, the 
occurrence frequency of each of the unqiue tokens. For the illustrated 
embodiment, the intial encoding is simply the order the unique tokens are 
encountered. For example, for the example "Employee" XML data stmcture 
400 of Fig. 4a, the initial encoding employed is as illustrated in Fig. 4c. That 
is, token "<" is encoded with the numeric cookie representation of "0", as it is 
encountered first, token "Employees" is encoded with the numeric cookie 
represenation of "1", as it is encountered next, and so forth. Thus, example . 
"Employee" XML data staicture 400 may be stored in a representative form in 
array 430a (corresponding to array 112 of Fig. 1) as illustrated in Fig. 4d. 

Thus, upon receipt of all tokens, i.e. data elements of the data structure 
being transmitted, the occurrence frequncies of the unique tokens of the data 
structure would be established. For the example XML data strcuture 400, it 
would have established that token "<" occurs 4 times, token "Employees" 
occurs once, token ">" occurs 8 times (the most frequent), and so forth, as 
illustrated in Fig. 4c. 

9 
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Thereafter, at blocks 226 and 228, data transmitter 108 replaces the 
initial cookie representations with replacement cookie representations that are 
functionally dependent on the occurrence frequency of the unique tokens, and 
the stored 'tokens" in their representative fonn are re-mapped to new 
representations. For example, the replacement cookie representation of "1 " is 
assigned to replace the initial cookie representation of "2" for the most 
frequently occurred token ">",the replacement cookie representation of "2" is 
assigned to replace the initial cookie representation of "6" for the second most 
frequently occurred token "=", and so forth. Correspondingly, the stored tokens 
in their initial representations (Fig. 4d) are re-mapped to the replacement 
representations (Fig. 4f). The remapping e.g. may be performed with the 
assistance of a remapping vector (not shown), which is known in the art. 

Thus, it can be seen that the encoding or compression operations of the 
present invention may be performed in a relatively straight fonA^ard manner, 
with relative low memory and processing requirements. As a result, the 
amount of memory and proecssing required on the sender side to "compress" 
the data elements for transmission (to achieve the desired bandwidth 
consumption reduction), under the present invention, is also advanageously 
smaller than other compression techniques known in the art, such as "Zip". 

Data Structures 

Figures 3a-3c illustrate a number of example data structures suitable 
for use to practice the present invention, in accordance with one embodiment. 
Shown in Figure 3a is example table 300 having at least three columns 302- 
306, suitable for use by data transmitter 108 to store the cookie 
representations (initial as well as final for the earlier described two steps 
embodiment), the represented tokens, and their occurrence frequencies. An 
abridged version of example table 300, without column 306 may be used by 
data receiver 116 to store the cookie representations, and the represented 
unique tokens. Shown in Figure 3b is example array 310 having a number 
storage slots suitable for use by data transmitter 108 to stored the encoded 
representations (cO, c1 , c2 etc.) of the tokens of a data structure being 

10 
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transmitted. Shown in Figure 3c is example string or stream 320 having two 
sections 322 and 326, separated by delimiters 324a-324b, suitable for use by 
data transmitter 108 to transmit the unique tokens (and implicitly convey their 
encoding representations), and the encoded representaions of the tokens of a 
data structure being transmitted. For the illustrated embodiment, first section 
322 is employed to transmit the unique tokens (and implicitly convey their 
encoding representations). Each unique token is preceded by the token size. 
For example, the token "<" is preceded by the token size value of "0x01". the 
token "</" is preceded by the token size "0x02", and so forth (as illustrated in 
Fig. 4g). The encoding representation for the token "<" is "1", as implied by the 
fact that the token is transmitted in the first transmission position, the encoding 
representation for the token "</" is "3", as implied by the fact that the token is 
transmitted in the third transmission position, and forth. Referring back to Fig. 
3c, as illustrated, second section 326 is employed to transmit the encoded 
representations of the tokens of the data structure being transmitted. 

Example Digital Device 

Figure 5 illustrates an example computing device suitable for use to 
practice the present invention, in accordance with one embodiment. As shown, 
computing device 500 includes general purpose processor 502, digital signal 
processor (DSP) 504, and system memory 506. Additionally, device or system 
500 includes GPIO 508 (for interfacing with I/O devices such as keyboard, 
cursor control and so forth) and communication interfaces 510 (such as 
network interface cards, modems, wireless transceivers and so forth). The 
elements are coupled to each other via system bus 512, which represents one 
or more buses. In the case of multiple buses, they are bridged by one or more 
bus bridges (not shown). More importantly, device or system 500 is provided 
with data transceiver 514 incorporated with the teachings of the present 
invention to send and receive data stmctures in the above described more 
efficient constituting element occurrence frequency based compression form. 

The number and type of processor, the size of memory, as well as the 

number of other elements employed are typically dependent on the intended 
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usage of example computing device 500. For example, if used as a wireless 
mobile telephone or a palm sized personal digital assistant, probably a 
relatively lower perfonmance processor and smaller amount of memory are 
used. On the other hand, if used as a notebook computer or a set top box, 
probably a relatively higher performance processor and more amount of 
memory are used, and may be even with the additional employment of mass 
storage devices. If used as a desktop computer or a server, probably even 
multiple high performance processors are employed, but may be without the 
employment of DSP 504 instead. 

Each of these elements performs its conventional functions known in the 
art. In particular, system memory 504 is employed to store a copy of the 
programming instructions implementing data transceiver 514. Except for its 
use to host novel data transceiver 514 incorporated with the transmit and 
receive teachings of the present invention, the constitution of these elements 
502-512 are known, and accordingly will not be further described. 

Conclusion and Epilogue 

Accordingly, a method and apparatus for sending and receiving a data 
structure in a constituting element occurrence frequency based compressed 
form has been described. As mentioned earlier, the present invention 
significantly reduces the number of bytes required to be transmitted, as well as 
the amount of memory and the amount of processing required on the sender 
and the receiver systems. 

While the present invention has been described in terms of the above 
illustrated embodiments, those skilled in the art will recognize that the invention 
is not limited to the embodiments described. The present invention can be 
practiced with modification and alteration within the spirit and scope of the 
appended claims. Thus, the description is to be regarded as illustrative instead 
of restrictive on the present invention. 
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CLAIMS 

What is claimed is: 

1 . A method comprising: 

receiving a plurality of constituting elements of a data structure; 

detemiining occurrence frequency of each unique constituting element 
in said data structure; 

assigning a cookie representation to each of said unique constituting 
elements based at least in part on the occurrence frequencies of said unique 
constituting elements; and 

transmitting said data structure implicitly in a substantively equivalent 
form that allows a receiver of said data structure in said substantively 
equivalent form to be able to reconstitute the data structure using said 
occurrence frequency based cookie representations. 

2. The method of claim 1 , wherein said determining and assigning 
comprises assigning an initial cookie representation to each unique constituting 
element as the constituting elements are received, and tracking occurrence 
frequencies of the unique constituting elements, and upon receipt of all 
constituting elements of the data structure, re-assigning a final cookie 
representation for each of the unique constituting elements based on the 
occurrence frequencies of the unique constituting elements. 

3. The method of claim 2, wherein the method furt:her comprises ordering 
said unique constituting elements based on their occurrence frequencies. 

4. The method of claim 2, wherein the method further comprises storing 
said constituting elements of the data structure as they are received, using said 
initial cookie representations, and subsequently replacing the stored initial 
cookie representations with the final cookie representations, and said 
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transmitting comprises transmitting said constituting elements of said data 
structure using said final cookie representations. 

5. The method of claim 4. wherein said transmitting further comprises 
transmitting a list of said unique constituting elements in the order of their 
occurrence frequencies to allow the receiver to infer the corresponding final 
cookie representations of the unique constituting elements. 

6. The method of claim 1 . wherein the cookie representations are numeric 
in form, with the cookie representations of the 128 most frequently occun-ed 
unique constituting elements having a size of one byte each, and the cookie 
representations of the next 32,640 most frequently occurred unique constituting 
elements having a size of two bytes each. 

7. The method of claim 1 , wherein said data structure is an XML data 
structure, and said constituting elements comprise tag names, attribute names 
and attribute values. 

8. A method comprising: 

receiving a plurality of unique constituting elements of a data structure 
transmitted in a pre-detemiined manner; 

infemng a plurality of con-esponding cookie representations for the 
received unique constituting elements In accordance with their manner of 
transmissions under the pre-detemnined manner of transmission; and 

receiving the constituting elements of the data structure in a 
representative form. 

9. The method of claim 8, wherein said infemng comprises inferring the 
plurality of corresponding cookie representations based on the order the 
unique constituting elements are transmitted. 



14 



wo 02/39592 



PCT/USOl/22488 



10. The method of claim 9, wherein said inferring comprises infening a 
unique one-byte numeric representation for each of the first 128 unique 
constituting elements transmitted, and a unique two-bytes representation for 
each of the next 32,460 unique constituting elements transmitted. 

1 1 . The method of claim 8, wherein the method further comprises 
reconstituting the constituting elements of the data structure, received in said 
representative form, based on the inferred cookie representations. 

12. The method of claim 8, wherein said data structure is an XML data 
structure, and said constituting elements comprises tag names, attribute 
names and attribute values. 

13. An apparatus comprising: 

storage medium having stored therein a plurality of programming 
instructions designed to receive a plurality of constituting elements of a data 
structure, determine occurrence frequency of each unique constituting element 
in said data structure, assign a cookie representation to each of said unique 
constituting elements based at least in part on the occurrence frequencies of 
said unique constituting elements, and transmit said data structure Implicitly in 
a substantively equivalent form that allows a receiver of said data structure in 
said substantively equivalent form to be able to reconstitute the data structure 
using said occurrence frequency based cookie representations; and 

at least one processor coupled to the storage medium to execute the 
programming instructions. 

14. The apparatus of claim 13, wherein said programming instructions are 
designed to perform said determining and assigning by assigning an initial 
cookie representation to each unique constituting element as the constituting 
elements are received, and tracking occurrence frequencies of the unique 
constituting elements, and upon receipt of all constituting elements of the data 
structure, re-assigning a final cookie representation for each of the unique 
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constituting elements based on the occunrence frequencies of the unique 
constituting elements. 

1 5. The apparatus of claim 1 4, wherein the programming instaictions are 
further designed to order said unique constituting elements based on their 
occurrence frequencies. 

16. The apparatus of claim 14. wherein the programming instructions are 
further designed to store said constituting elements of the data structure as 
they are received, using said initial cookie representations, and subsequently 
replace the stored initial cookie representations with the final cookie 
representations, and said programming instmctions perform said transmitting 
by transmitting said constituting elements of said data structure using said final 
cookie representations. 

17. The apparatus of claim 16, wherein said programming instructions are 
further designed to transmit a list of said unique constituting elements in the 
order of their occunrence frequencies to allow the receiver to infer the 
corresponding final cookie representations of the unique constituting elements. 

18. The apparatus of claim 13, wherein the programming instructions are 
deigned to employ cookie representations in numeric form, with the cookie 
representations of the 128 most frequently occurred unique constituting 
elements having a size of one byte each, and the cookie representations of the 
next 32,640 most frequently occurred unique constituting elements having a 
size of two bytes each. 

19. The apparatus of claim 13, wherein said programming instaictions are 
designed to perform said receive, determine, assign and transmit for an XML 
data structure, said constituting elements comprising tag names, attribute 
names and attribute values. 
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20. The apparatus of claim 13, wherein said apparatus is a selected one of 
a wireless mobile phone, a palm sized personal digital assistant, a notebook 
sized computer, a desktop computer, a set top box and a server. 

21 . An apparatus comprising: 

storage medium having stored therein a plurality of programming 
instmctions designed to receive a plurality of unique constituting elements of a 
data structure transmitted in a pre-determined manner, infer a plurality of 
corresponding cookie representations for the received unique constituting 
elements in accordance with their manner of transmissions under the pre- 
determined manner of transmission, and receive the constituting elements of 
the data structure in a representative form; and 

at least one processor coupled to the storage medium to execute the 
programming instructions. 

22. The apparatus of claim 21 , wherein said programming instructions are 
designed to infer the plurality of corresponding cookie representations based 
on the order the unique constituting elements are transmitted. 

23. The apparatus of claim 22, wherein said programming instructions are 
designed to infer a unique one-byte numeric representation for each of the first 
128 unique constituting elements transmitted, and a unique two-bytes 
representation for each of the next 32.460 unique constituting elements 
transmitted. 

24. The apparatus of claim 21 , wherein said programming instructions are 
further designed to reconstitute the constituting elements of the data structure, 
received in said representative form, based on the inferred cookie 
representations. 

25. The apparatus of claim 21 , wherein said programming instructions are 
designed to perform said receive, infer, receive, and re-constitute for a XML 
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data structure, said constituting elements comprising tag names, attribute 
names and attribute values. 

26. The apparatus of claim 21 , wherein said apparatus is a selected one of 
a wireless mobile phone, a palm sized personal digital assistant, a notebook 
sized computer, a desktop computer, a set top box and a server. 
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